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ABSTRACT 

The Hi Parkes All-Sky Survey (HiPASS) Catalogue forms the largest uniform cata- 
logue of Hi sources compiled to date, with 4,315 sources identified purely by their Hi 
content. The catalogue data comprise the southern region 5 < +2° of HiPASS, the 
first blind Hi survey to cover the entire southern sky. RMS noise for this survey is 13 
mJy beam^^ and the velocity range is —1,280 to 12,700 kms^^. Data search, verifica- 
tion and parametrization methods are discussed along with a description of measured 
quantities. Full catalogue data are made available to the astronomical community in- 
cluding positions, velocities, velocity widths, integrated fluxes and peak flux densities. 
Also available are on-sky moment maps, position-velocity moment maps and spectra 
of catalogue sources. A number of local large-scale features are observed in the space 
distribution of sources including the Super-Galactic plane and the Local Void. Notably, 
large-scale structure is seen at low Galactic latitudes, a region normally obscured at 
optical wavelengths. 

Key words: methods: observational - surveys - catalogues - radio lines: galaxies 
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Table 1. Parameters of existing blind Hi surveys. 5v is the velocity resolution. TVJJ™ is the limiting Hi column density for gas filling 
the beam. Column density limits are 5cr and calculated with AV = 100km s^^, scaling RMS appropriately according to the velocity 
resolution. SCC, BGC and HiCAT all use HiPASS data. The SCC sample searches <5 < —62° and the BGC searches 5 < 0° and Sp > 116 
mJy. The references cited are as follows: ^Shostak (1977), ^Krumm & Brosch (1984), "'Kerr & Henning (1987), Henning (1992), ''Sorar 
(1994),Zwaan et al. (1997),Spitzak & Schneider (1998), '^Rosenberg & Schneider (2000), '^Henning et al. (2000), **Kilborn ct al. (2002), 
Braun, Thilker & Walterbos (2003), ^^Lang et al. (2003), "Koribalski et al. (2004), ^^ this paper. 



Name 


Area 


Beam Size 


Velocity 


Sv 


RMS 


ATlim 
^^ HI 


Telescope 


Sources 




(dcg2) 


(arcniin) 


(krns-l) 


(kms-i 


) (mJy 
bcarn^i 


(cm-2) 

) 






Shostak^ 


85, 
70, 
11 


10.8 


~-800 to ~2,835 


11 


32, 
44, 
18 


1.6x101'^, 
2.3x101^5, 
9.2xl0i» 


91m Green Bank 


1" 


Krumm & 


~35, 


10.8 


6,300 to 9,600, 


~45 


22 


2.2xl0i9 


91m Green Bank 





Brosch^ 


~44 




5,300 to 8,500 












Henning-* 


7,204 
pointings 


10.8 


-400 to 7,500 


22 


- 


- 


91m Green Bank 


37 


AHISS* 


65 


3.3 


<7,400 


16 


0.75 


4.3xl0i8 


305m Arecibo 


66 


Slice Survey^ 


55 


3.3 


100 to 8,340 


16 


1.7 


9.6x101* 


305m Arecibo 


75 


ADBS** 


430 


3.3 


<7,980 


34 


3.5 


2.8xl0i^ 


305m Arecibo 


265 


HIZSS^ 


1,840 


15.5 


-1,200 to 12,700 


27 


15 


5.0xl0i» 


64m Parkes 


110 


scc*§ 


~2400 


15.5 


-1,280 to 12,700 


18 


13 


4.0x101* 


64m Parkes 


536 


WRST Wide 


1,800 


49 


-1,000 to 6,500 


17 


18 


4.8x1017 


Westcrbork Array 


155 


Field Survey^ 


















HIJASS106 


1,115 


12 


-3,500 to 10,000 


18 


16 


7.3xl0i« 


76m Jodrell Bank 


222 


BGC"§ 


20,626 


15.5 


-1,200 to 8,000 


18 


13 


1.7x101'^'' 


64m Parkes 


1,000 


HiCAT 12 § 


21,346 


15.5 


300 to 12,700 


18'= 


13 


4.0x101** 


64m Parkes 


4,315 



"Not certainly extragalactic. 

''Survey in progress. 

''Data velocity resolution. Parametrization 

''Based on catalogue selection limits rather 



was carried out with additional 
than 5(T survey limit. 



Hanning smoothing, giving a velocity resolution of 26.4 km s 



1 INTRODUCTION 

The neutral hydrogen (Hi) content of galaxies provides a 
unique and fundamental perspective into the nature and 
evolution of galaxies, being intimately linked to both the 
dark matter component of galaxies along with their ability 
to form stars. 

Surveys for Hi in galaxies, however, are limited by the 
relative weakness of Hi photons. Despite a cosmological mass 
density ratio of Hi to luminous matter of only l:10,i the im- 
probability of the Hi hyperfine transition and weakness of 
photons emitted mean that the received power ratio favours 
optical photons by many orders of magnitude. In combi- 
nation with observational limitations, this has traditionally 
necessitated either blind surveys of relatively small volumes 
(eg. Shostak 1977, Krumm & Brosch 1984, Kerr & Hen- 
ning 1987, Henning 1992, Sorar 1994, Zwaan et al. 1997, 
Spitzak & Schneider 1998, Rosenberg & Schneider 2000), 
or targeted surveys across larger regions through the use 
of existing, typically optical, catalogues (eg. Fisher & TuUy 
1981, Mathewson et al. 1992, Giovanelh et al. 1997, Haynes 
et al. 1999). Parameters for existing blind Hi surveys are 
summarized in Table 1. 

With the advent of Multibeam receivers on the Parkes 
and Jodrell Bank radio telescopes, along with the forthcom- 
ing ALFA instrument at Arecibo, large untargeted surveys 
are now practicable for the first time. In this work we discuss 
results from the Hi Parkes All-Sky Survey (HiPASS), a bhnd 



1 f2Hl = (3.8 ± 0.6) X lO-l/ifgl (Zwaan et al. 2003) cf. ^s = 
(39 ± 5.7) X lO^'l/i^g (Salpeter initial mass function. Cole et al. 
2001) 



survey of the entire sky 5 < -1-25°. Various extragalactic 
catalogue subsets from this survey have already been com- 
pleted including: Kilborn et al. (2002, 5 < —62); Koribalski 
et al. (2004, 5 < 0° and Sp > 116 mJy) and companion 
analysis papers Zwaan et al. (2003, Hi mass function) and 
Ryan- Weber et al. (2002, newly catalogued galaxies); Waugh 
et al. (2002, Fornax region); and Banks et al. (1999, Centau- 
rus A group). Presented here are results from the southern 
region of the survey {5 < -1-2°); these data were searched for 
extragalactic sources to full survey depth. 

A complete database of Hl-selected galaxies is im- 
portant for a number of important astrophysical studies. 
Amongst these is a measurement of the Hi cosmic mass den- 
sity, f^Hi. At higher redshifts, Qui can be measured through 
damped Lya systems, however at z=0, the only way to ac- 
curately measure flni is through large-scale surveys for Hi 
emission. Galaxies under-represented in optical surveys such 
as low surface brightness galaxies, those with elusive optical 
counterparts and Malin 1 type systems (Bothun et al. 1987) 
are also best detected via blind Hi surveys. Gaining an ac- 
curate census of such objects is important for constraining 
models of galaxy formation, although not contributing a sig- 
nificant fraction of the overall gas mass density (Zwaan et al. 
2003). 

Other important applications for a large uniform cata- 
logue of Hi sources include the measurement of large-scale 
structure, 2-point correlation functions, the TuUy-Fisher re- 
lation, the role of hydrogen gas in the evolution of galaxy 
groups and the study of environments around Hl-rich galax- 
ies compared to those of optically-selected galaxies. 

In Section 2 we discuss the observational methods, 
data processing and basic properties of HiPASS. Methods 
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used to compile the source database are described in detail 
in Section 3, including the identification, verification and 
parametrization of sources, confused- and extended-source 
processing, the effects of radio-frequency interference (RFI) 
and recombination lines, and follow-up observations. Access 
to catalogue data is described in Section 4. Finally, basic 
property distributions are presented in Section 5 and the 
large-scale distribution of galaxies in Section 6. Reliability, 
completeness and accuracy of catalogue parameters are dis- 
cussed in the companion paper by Zwaan et al. (2004, Pa- 
per II). The identification of optical counterparts to HiPASS 
Catalogue galaxies is made in Drinkwater et al. (2004, Paper 
III). 



2 THE HI PARKES ALL-SKY SURVEY 

HiPASS was completed at the Parkes^ 64m radio telescope 
using the Multibeam receiver, an hexagonal array of 13 cir- 
cular feed-horns installed at the focal plane of the telescope. 
Observations and data reduction for this survey are de- 
scribed in detail in Barnes et al. (2001) and Staveley-Smith 
et al. (1996). A brief summary is provided below. 



2.1 Observations 

HiPASS observations were carried out from February 1997 
to March 2000 for the southern portion of the survey 
{5 < +2°) and to December 2001 for the northern ex- 
tension {S < +25°). For these observations, the telescope 
was scanned across the sky in 8° strips at a rate of 1° per 
minute, with scans separated by 7 arcmin in RA, giving a 
total effective integration time of 450s beam~^. To obtain 
spectra, the Multibeam correlator was used with a 64 MHz 
bandwidth and 1024 channel configuration, giving a velocity 
range of —1,280 to 12,700 kms^^and a channel separation 
of 13.2 kms^^ at z = 0. Parameters for the survey are sum- 
marized in Table 2. 



2.2 Data Processing 

Initial processing of HiPASS data was completed in real-time 
at the observatory. To correct spectra for the standard band- 
pass effects, spectra were reduced using the package live- 
data. This package also performed the conversion to helio- 
centric rest frame velocities by shifting spectra using Fourier 
techniques. 

Bandpass correction is done by dividing the signal tar- 
get spectrum by a reference off-source spectrum, this spec- 
trum representing the underlying spectral shape caused by 
signal filtering, as well as the temperature of the receivers, 
ground and sky. For the HiPASS data, the reference spectrum 
for each receiver is obtained by taking signals immediately 
before and after a particular integration as the telescope 
scans across the sky. To provide a robust measure of the 
bandpass, the value for each channel in the reference spec- 
trum is taken to be the median rather than the mean of 



^ The Parkes telescope is part of the Australia Telescope which 
is funded by the Commonwealth of Australia for operation as a 
National Facility managed by CSIRO 



Table 2. HiPASS parameters 



Sky Coverage 
Integration time per beam 
Average FWHM 
Gridded FWHM 
Pixel size 
Velocity Range 
Channel separation^ 
Velocity Resolution 
Positional accuracy-f 
RMS noise 



S < +25°* 

450s 

14.3 arcmin 

15.5 arcmin 

4 arcmin 

-1,280 to 12,700 kms^l 

13.2 kms^i 

18.0 kms^i 

1.5 arcmin 

13 mjy beam^^ 



* The HiPASS Catalogue covers southern regions 5 < +2° 

t at 2 = 

■f la accuracy at 99% completeness flux limit 

(Zwaan et al. 2004) 




Galactic HI emission 



Figure 1. Orthogonal projection of HIPASS cube 93 showing 
emission from the Milky Way as well as other nearby galaxies. 
Velocity range is truncated at 10,000 kms^^. 



available signals. However, one artifact that can be created 
by this process is the generation of negative side lobes north 
and south of particularly bright Hi sources, since actual Hi 
emission is included in the bandpass determination. Finally, 
spectra are smoothed with a Tukey filter to suppress Gibbs 
ringing, resulting in a velocity resolution of 18 kms~^. 

To create the three-dimensional position-position- 
velocity cubes that form the main data product of HiPASS, 
the bandpass-corrected spectra are gridded together using 
the package Gridzilla. The spectrum at each RA-Dec pixel 
is taken as the median of all data within 6 arcmin of the pixel 
centre, multiplied by 1.28 to correctly scale point source 
fiux densities. This procedure thus corrupts data for ex- 
tended sources in the standard HiPASS data, with peak flux 
densities"^ for such sources over-estimated (by 1.28 for an 
infinitely extended source). Extended sources nevertheless 
represent only a small fraction of overall sources (~2%, see 
Section 3.5) and the corresponding corrections are small. We 
therefore apply no correction to measured catalogue param- 
eters for this effect. The resulting cubes from the gridding 
process are 8° x 8 ° with 4' x 4' pixels in the on-sky direc- 
tions and extend over the full HiPASS velocity range in the 
third axis (an example cube is shown in Figure 1). Over- 



'peak flux density' is hereafter referred to as 'peak flux' 
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Table 3. Detection numbers at different stages of catalogue com- 
pilation 



Stage 



Number 



Generated by MuLTlFiND 137,060 

Generated by TopHat 17,232 

Examined in first two verification stages 142,276 

Examined in third verification stage 61,276 

Final catalogue 4,315 



lap regions between cubes vary, but in general are ~ 1° in 
RA and Dec. In total, 388 cubes are required to cover the 
southern sky. 

Lastly, a correction is applied to the spectra in each 
HiPASS data cube to minimize baseline distortion caused by 
the Sun and other continuum sources. This is applied using 
the program Luther and involves the fitting and subtrac- 
tion of a template distortion spectrum. 



3 THE CATALOGUE 

The HiPASS Catalogue (Hicat) is compiled using a com- 
bination of automatic and interactive processes, with can- 
didate detections first generated through automated finder 
scripts and then manually verified. Detections are finally 
parametrized using semi-automated routines. These are dis- 
cussed below; a summary of the number of detections re- 
maining at each stage is given in Table 3. 

3.1 Candidate Generation 

To generate candidate detections, the results of two separate 
scripts are combined. The first of these, a modified version 
of MuLTlFiND (Kilborn 2001), uses a peak fiux threshold 
method, while the second, TopHat, cross-correlates spec- 
tra with tophat profiles of various scales. The two methods 
are used together as neither is able to recover fully the real 
sources detected by the other (MultiFind locates ~83% of 
final catalogue sources, while TopHat finds ~90%). Both 
candidate lists are then merged and automatically cleaned 
to remove high velocity clouds (HVCs) and suspect detec- 
tions at known interference frequencies as detailed below. 

3.1.1 MultiFind 

The first of the finder algorithms is based on miriad (Sault 
et al. 1995) routines. Each of the velocity planes in a data 
cube is searched for detections rising above a specified peak 
fiux density and a Gaussian fitted to each pixel clump. For 
HiCAT, the peak flux threshold is set to four times the cal- 
culated noise. Cube noise is calculated robustly using the 
relation: 



cube noise : 



(1) 



where s is the median absolute deviation from the median. 
A median noise estimate is used to provide a method in- 
sensitive to outliers, with the applied conversion equivalent 
to estimating RMS of the normally distributed component 



of cube noise. Detections are filtered to include only those 
detections present in adjacent velocity planes with a sepa- 
ration < 5 arcmin. 

This detection process is repeated twice, each time Han- 
ning smoothing the data cube to increase detection signal- 
to-noise, and cube noise re-measured. Finally, detection lists 
are combined and duplicated detections removed. 



3.1.2 TopHat 

In the second finder algorithm, each spectrum in the cubes 
is searched for emission on a variety of velocity width scales 
between 1 and 40 channels. An initial list of detections is 
created as follows. First, solar and continuum ripple in the 
spectral baselines is reduced by using a moving median filter 
with a width dependent on the current search scale. Spectra 
are then cross-correlated with a top hat of the appropriate 
size, with the convolution weighted by the noise in each ve- 
locity plane. A feature is detected in a convolved spectrum 
if it rises above a threshold proportional to the interquartile 
range values in that spectrum. 

In the second stage of candidate identification, detec- 
tions from all scales are island grouped, listing together all 
detections that are either adjacent or overlapping (the ma- 
jority of features will be detected on multiple scales). The 
bounding box of these groups is then found, and the original 
data re-fitted to create the entry in the final detection list 
for each group. 



3.1.3 Cleaning 

Results from the automated finders are then merged and 
cleaned. First, all candidates with «gsr < 300 kms~^ are 
removed to avoid Galactic Hi emission and HVCs (HVCs in 
HiPASS are catalogued separately, see Putman et al. 2002). 
Additionally, an initial filter is applied to remove false detec- 
tions arising from hydrogen recombination lines and radio 
frequency interference. All detections having velocity widths 
< 25.0 kms~^ in the ranges 2,638 < Whd < 2,670 kms"\ 
4,355 < Uhoi < 4,397 kms-\ 4,505 < Whei < 4,559 kms'^ 
and 7,428 < Uhei < 7,482 kms~^ are removed. We note 
that many more recombination line and radio frequency in- 
terference detections are discarded in subsequent verifica- 
tion procedures than by this simple and conservative filter 
(see Sections 3.6 & 3.7 for further discussion). As no sources 
in the final catalogue have a velocity width < 25 kms~^, 
these cuts are not expected to have any effect on the final 
catalogue completeness. A declination limit of 5 < -1-2° is 
also applied. Following this process, ~ 140, 000 candidates 
remain to be manually checked. 



3.2 Candidate Verification 

Verification of candidate sources is a multi-stage process. 
First, two independent checks are made on each candidate, 
visually inspecting the full HiPASS spectrum summed from a 
12 X 12 arcmin box around the position of the detection. The 
aim of this process is to quickly remove obviously spurious 
detections without rejecting any real sources. 

Next, as a preliminary pass at removing multiple de- 
tections of the same object, candidates are grouped using 
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Table 4. HiPASS parameter descriptions. Unless otherwise stated, all velocities are in the heliocentric restframe and cz convention. 



Parameter 



Database 
Name 



Units 



Description 



Name 



hipass_najne 



RA 


ra 


hrs 


Dec 


dec 


deg 


,,max 
^50 


vel_50max 


kms^-"- 


j.min 


vel_50iiiin 


kms^-"- 


^.max 
^20 


vel_20max 


kms^-"- 


,,min 


vel_20iiiin 


kms^-"- 


^mom 


veljnom 


kms^ 


«Sp 


vel.Sp 


kms~ 


«gsr 


vel_gsr 


kms"-*^ 


^Ig 


vel.lg 


kms^-"^ 


■"cmb 


vel_cmb 


kms^-*- 


l^lo 


vel_lo 


kms^ 


''hi 


veljii 


kms^ 


"^spcclo 


vel.speclo 


kms"-*^ 


"^spcchi 


vel.spechi 


kms^ 


^mask 


veljnask 


kms^-"- 


Ti/max 


width_50max 


kms^-"- 


TT/min 


width_50mln 


kms^-"- 


ii/max 
•^20 


width_20max 


kms^-"- 


TT/min 

'^'^20 


width_20mln 


kms^-"- 


Sp 


Sp 


Jy 


Sint 


Sint 


Jy kms^^ 


RMS 


rms 


Jy 


RMSciip 


rms.cllp 


Jy 


RMScube 


rms.cube 


Jy 


cube 


cube 


- 


CF 


Sigma 


kms^-*- 


box size 


boxsize 


arcmin 


comment 


comment 


- 


follow-up 


nb_flg 


- 


confused 


cfsd_flg 


- 


extended 


extJlg 


- 



names are of the form HIPASS JXXXXitYY[a-z], where XXXX is the unrounded source 

RA in hrs and min, and YY is the unrounded source declination in degrees. An additional 

letter a-z is added if needed to distinguish sources. Names arc also made consistent with 

existing published sources where necessary. 

right ascension (J2000 hexadecimal format) 

declination (J2000 hexadecimal format) 

average of velocities at which profile reaches 50 per cent of peak flux density 

(using width maximization procedure," recommended velocity) 

average of velocities at which profile reaches 50 per cent of peak flux density 

(using width minimization procedure") 

average of velocities at which profile reaches 20 per cent of peak flux density 

(using width maximization procedure") 

average of velocities at which profile reaches 20 per cent of peak flux density 

(using width minimization procedure") 

flux weighted velocity average between v\q and v-^^ 

velocity at which profile peak flux occurs 

Galactic standard of rest velocity 

(converted from heliocentric Dmom using dc Vaucoulcurs et al. 1991) 

Local Group standard of rest velocity 

(converted from heliocentric i)mom using Karachentsev & Makarov 1996) 

Cosmic Microwave Background standard of rest velocity 

(converted from heliocentric Vmora using Fixsen et al. 1996) 

manually specified minimum profile velocity 

manually specified maximum profile velocity 

manually specified minimum velocity for spectral plots and RMS measurement 

manually specified maximum velocity for spectral plots and RMS measurement 

manually specified mask regions over which to interpolate baseline fitting and to exclude 

values from RMS measurement (in velocity pairs) 

difference of velocities at which profile reaches 50 per cent of peak flux density 

(using width maximization procedure") 

difference of velocities at which proflle reaches 50 per cent of peak flux density 

(using width minimization procedure") 

difference of velocities at which proflle reaches 20 per cent of peak flux density 

(using width maximization procedure") 

difference of velocities at which profile reaches 20 per cent of peak flux density 

(using width minimization procedure") 

peak flux density of profile 

integrated flux of source (within region diq, vi^\ and box size) 

RMS of masked spectrum between tJapcclo ^^d tigpechi 

clipped RMS of masked spectrum between tispeclo a-nd ^spcchi 

unsmoothed RMS of cube (median absolute deviation from median nmltiplied by ^■k/2) 

cube number 

standard deviation of Gaussian used in baseline smoothing 

box size used for parameter measurements 

comment (l=real, 2=have concerns) 

narrow band follow-up status flag (l=real) 

confused source flag (l=confused) 

extended source flag (l=extended). For extended sources 5int is summed flux within 

region rather than default point-source weighting. 



" Maximization procedure starts at profile velocity limits and searches inward until the required per cent of peak flux 
is reached. The minimization procedure starts at fs and searches outward. 



an angular separation limit of 5 arcmin and a velocity sep- 
aration of 100 kms^^. In this process a detection is added 
to an existing group if it is within the separation limits of 
any of its members. One detection from each group is then 
selected for further examination, giving preference first to 
those not rejected in both of first checks, then to the de- 
tection with the highest peak flux and flnally, in instances 
of equal peak flux, to the detection with the greatest veloc- 
ity width. Group size warning limits are set to 10 arcmin 
and 200 kms^^. Groups with sizes exceeding these warning 



limits are later inspected to ensure no nearby galaxies are 
removed (see Section 3.4). 

In the third stage of detection veriflcation, candidates 
not rejected by both of the flrst checks are examined in spec- 
tral, position, and position-velocity space. Third checks are 
carried out by one of only three selected observers to im- 
prove consistency. Positions are fitted using automatic iter- 
ative fitting routines based on original detection coordinates. 
To generate spectra, candidates are treated as unresolved. 
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Figure 2. Examples of baseline fitting using polynomials (left) and Gaussian smoothing (right). Upper polynomial fit uses a 5th order 
polynomial and lower fit uses 1st order polynomial. 
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Figure 3. Example plots examined during the final checking and parametrization processes: (top-left) RA-Dec moment map showing 
the default 28 X 28 arcmin data box used for position fitting; (top-right) spectrum within the 28 X 28 arcmin box, treating the detection 
as a point source and weighting pixels by Parkes' beam parameters - vertical lines show manually specified velocity limits within which 
profile parameters are measured and the RA-Dec moment map generated; (bottom) Dec-velocity moment map showing 28 arcmin box 
and profile velocity limits. 



with pixels in a 28 x 28 arcmin box used to calculate source 
flux in a given velocity plane via: 



5w-e4? 



(2) 



where Si is the flux at a given pixel (and velocity «), and fi 
is the value of the Gaussian beam at that pixel, relative to 
the previously fitted centre of emission. 

While every effort is made to either reject or confirm de- 
tections, any detections unable to be evaluated either way 
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Figure 4. Example spectrum following baseline fitting and re- 
moval showing the locations of Sp, W^^'^, W^^'', W^^'^, W^^"", 
Dgg'"', i'5q'", I'mom, ^lo s-iid Dhi • The parameters f^'"' and ijJq" 
are not shown but are defined similarly to v^^ and ^50°. The 
offset of «[nom toward higher velocities from v'^q^^ is due to the 
asymmetric nature of profile. 



are left in the catalogue at this stage. Spurious detections 
are flagged in the parametrization stage for follow-up obser- 
vations. 



parameter measurement, an ellipse is re-fitted to the new de- 
tection location, and the measurement region again moved 
to the ellipse position if the fit is successful. Plots of the 
detection are now generated in RA-Dec, Dec-velocity and 
spectral spaces, along with initial parameter measurements 
(see Figure 3). Spectra are Hanning smoothed for param- 
eter measurement to improve signal-to-noise, giving a final 
velocity resolution of 26.4 kms~^. Spectrum and profile lim- 
its, masked regions, initial fitting position and final position 
can all be varied using the interactive windows at this point. 
Checks can be made on the veracity of the detection by 
changing grey-scales, box size, Hanning smoothing and the 
velocity range displayed. The size of the Gaussian kernel 
used in baseline fitting can also be adjusted. For each ad- 
justment made, images and moment maps are regenerated, 
where appropriate, with fitting redone accordingly. When a 
satisfactory fit is achieved, the detection is given a final com- 
ment to confirm or reject it, with a follow-up flag available 
if necessary. By default, all fluxes are measured using the 
point source method, treating sources as non-extended with 
the central high signal-to-noise pixels preferentially weighted 
according to Parkes' beam parameters. The treatment of 
extended sources is discussed in Section 3.5. A full list of 
parameters measured is given in Table 4, along with an ex- 
ample spectrum illustrating various measured quantities in 
Figure 4. 



3.3 Parametrization 

The parametrization of HiCAT sources is done interactively 
using standard miriad routines, together with a modified 
version of mbspect that uses Gaussian smoothing for base- 
line fitting. All parametrization is done by M.M. and M.Z. 



3.3.1 Baseline Fitting 

Baseline fitting is carried out by first linearly interpolating 
the spectrum over the galaxy profile, as specified with the 
parameters vio and Vhi (see Table 4). The average values 
of the ten channels on either side of the masked profile are 
taken as the endpoints for the interpolation. The resulting 
spectrum is then convolved with a Gaussian of an interac- 
tively specified width. A Gaussian is used to provide the 
smoothest estimate of the spectral baseline. It is important 
in choosing the width of the Gaussian kernel to ensure that 
the scale is large enough to fit smoothly beneath the galaxy 
profile without creating spurious small scale variations. The 
Gaussian also needs to be wide enough not to reduce ar- 
tificially the noise of the observed spectrum. At the same 
time, the Gaussian should be narrow enough to fit the base- 
line without missing the peaks and troughs of the spectrum. 
Example spectra comparing Gaussian smoothing baseline 
fitting to polynomial baseline fitting are shown in Figure 2. 



3.3.2 Parameter Measurement 

To parametrize a detection, the position of the candidate 
is first refined by fitting an elliptical Gaussian to a 28 x 28 
arcmin moment map around galaxy finder coordinates. Pro- 
vided fitting is successful, the coordinates of the detection 
are taken as those of the ellipse and the 28 x 28 arcmin 
moment map regenerated. Entering an interactive loop for 



3.4 Confused Source Identification and Duplicate 
Removal 

To combine the lists from individual cubes, the full set of 
results is again island-grouped and cleaned to remove dupli- 
cate detections in the cube overlap regions. Accurate posi- 
tions enable much tighter matching limits, now chosen at 1.5 
arcmin and 20 kms~^. Tight limits also avoid accidentally 
removing real source pairs. 

As the scales of duplicate detections and confused 
source pairs overlap at this point, all galaxies with velocity 
profiles overlapping and on-sky position within 6a of each 
other (as defined by the gridded Parkes beamwidth) are ex- 
amined manually. Also included in this process are any de- 
tections previously fiagged as confused during parametriza- 
tion, as well as galaxies originating from cleaned groups that 
exceeded size warning limits in either of the island group- 
ing stages. Confused sources flagged in the final catalogue 
are either those manually identified at this stage, or detec- 
tions with velocity profiles overlapping and source separa- 
tions < 4(T. 

Sources that could reasonably be distinguished (ie. mea- 
sured fluxes largely due to the galaxy itself rather than a 
close neighbour), are identified and measured separately, 
whereas those for which this was not possible are grouped 
together into a single source and the detection manually 
fiagged as confused. 



3.5 Extended Sources 

The minimum expected number of extended sources in 
HiCAT can be roughly estimated using the relation between 
Hi mass and Hi diameter found for an optically selected 
sample of galaxies (Broeils & Rhee 1997): 



8 M.J. Meyer et al. 









40 


- 


-; 


_ 


- 


- 


m 






S 


■ 


, 


I ^° 


— 


] 


" 


cl 










1 
.a 

on 


■ 1 > 1 1 1 1 


1 1 1 r 1 1 1 1 1 ■ 


I 1 1 


1 



,0. 0.5 1 J. 5 2 

log[S.^, <i""i'>/Jy km s"'] 

Figure 5. Diflference between summed flux in 44 X 44 arcmin 
box and flux in 28 X 28 arcmin box weighted by Parkes beam pa- 
rameters assuming a point source. Values increase toward higher 
integrated fluxes as a result of extended sources. 



logiMni) = (1.96 ± 0.04) log{DHi) + (6.52 ± 0.06), (3) 

where Mhi is Hi mass in Mq and Dhi is the Hi diameter 
defined at a surface density of 1 Mqpc~^ in kpc. Combining 
this with the relation between Hi mass (in solar masses), 
distance (in Mpc) and observed flux J SdV (= Sint, in 
Jy kms~^): 



Mhi = 2.356 x 10^ D^ 



SdV 



(4) 



and making the approximation 1.961og(D) ~ log(I?^), this 
gives a relation for source size that depends solely on source 
flux and is independent of distance (9hi is the source diam- 
eter in arcminutes): 




5000 

Velocity (km/s) 




5000 

Velocity (km/s) 

Figure 6. Velocity distribution of HiPASS galaxies after the sec- 
ond verification check, with labels corresponding to potential RFI 
frequencies (top) and hydrogen recombination frequencies (bot- 
tom). 



SdV ^l.2ei 



(5) 



The relationship between / SdV and diameter can be 
empirically conflrmed by examining the difference between 
summed integrated flux in a 44 x 44 arcmin box and the 
standard non-extended source integrated flux in a 28 x 28 
arcmin box (pixels weighted by the Parkes beam parame- 
ters) . This difference is plotted as a function of the summed 
44 X 44 arcmin integrated flux in Figure 5. As expected, the 
average difference is close to zero at small integrated fluxes 
(point sources) and positive for large integrated fluxes (ex- 
tended sources), a positive value indicating that some flux 
was missed through use of the point-source method. The 
large amount of scatter in this graph is indicative of the un- 
certainties in measuring integrated fluxes by summing the 
value of all pixels in a given region, compared to the point 
source method. 

To identify extended sources in HiCAT, all sources 



greater than 7' in size are taken as potentially extended, 
this limit corresponding to 57 Jy kms"'^ from Equation 5. 
Assuming a uniform galaxy flux distribution, 93% of source 
flux is retrieved for a 7' diameter source if treated as un- 
resolved, and higher for a centrally biased distribution. In 
total, there are 188 candidate sources brighter than this 
flux limit, where flux is measured using an 44 x 44 arcmin 
'summed' box. Each source is examined manually, re-fitted 
and flagged if necessary. Where extended sources are con- 
fused or only marginaUy extended, detections remain fltted 
as point sources if this best removes erroneous flux contribu- 
tions from neighbouring galaxies or cube noise. In the flnal 
catalogue, 90 sources are flagged as extended. 

3.6 Interference 

To minimize the potential effects of interference, scans in 
HiPASS were taken in 5 separate sets, each 35 arcminutes 
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Figure 7. Example spectra and posit ion- velocity maps of GPS 
L3 beacon 
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Table 5. Hydrogen An=l to An=5 recombination lines in the 
range of HiPASS observations (Lilley & Palmer 1968) and their 
apparent Hi velocities. 

Transition Apparent Hi Velocity (kms^-*^) 



H166a 
H209b 
H239c 
H263d 
H283e 
H240c 
H210b 
H167a 
H284e 
H264d 
H285e 
H241c 
H265d 
H211b 
H168a 
H286e 
H266d 
H242c 



-911 
36 
438 
1434 
1540 
4199 
4340 
4507 
4718 
4857 
7918 
7991 
8306 
8685 
9990 
11140 
11781 
11815 



spectral nature of these checks and their deliberately conser- 
vative approach to candidate rejection. However, the exami- 
nation of position- velocity maps in the third stage of check- 
ing enables the removal of these false detections as shown 
by the final velocity distribution of the catalogue in Fig- 
ure 9. The time varying nature of the GPS signal, combined 
with the HIPASS observation and data processing meth- 
ods, results in a signal that is dominated by only a few on- 
sky pixels, much narrower than the signal of an unresolved 
source (see Figure 7). Similarly, narrow-line RFI can be dis- 
tinguished by the narrow-line signature spread across large 
spatial regions (Figure 8). 

The lack of features in the final velocity distribution at 
RFI frequencies shows that HiCAT completeness and relia- 
bility are not significantly degraded at these locations. This 
point is discussed further in Section 3.5 of Paper II. 



Figure 8. Example spectra and position- velocity maps of narrow- 
line interference 



apart, at well separated times. Nevertheless, some narrow- 
band interference remains in the data, the most promi- 
nent of which is a line at 1408 MHz (corresponding to 
-- 2640 kms"^) caused by the 11th harmonic of the 128 MHz 
multibeam correlator sampler clock (Barnes et al. 2001), and 
another line at 1400 MHz (corresponding to ~ 4370 kms^^) 
again caused by local interference. The presence of these, 
along with a number of other narrow interference lines, is 
clearly evident in the distribution of HiCAT candidates fol- 
lowing the first two checks of the cataloguing process (top 
panel. Figure 6). Also apparent in the velocity distribution 
are detections associated with the GPS L3 beacon at 1381 
MHz (--^ 8550 kms^^). These detections cover a range of 
velocities due to intrinsic signal spreading of the GPS bea- 
con. The presence of these interference detections following 
the first two verification checks is consistent with the purely 



3.7 Recombination lines 

Another source of potentially erroneous Hi detections in 
HiCAT is the recombination of warm ionized gas in the Milky 
Way. The full list of hydrogen An=l to An=5 recombination 
lines appearing in the HiPASS velocity range is given in Ta- 
ble 5 (taken from Lilley & Palmer 1968) and plotted against 
the distribution of detections following the first two verifica- 
tion checks in Figure 6). Given the increased line strengths 
of An=l transitions compared to those with higher prin- 
cipal quantum number differences, the erroneous detection 
spikes in Figure 6 most likely due to hydrogen recombination 
are H167a and H210b. These also coincide with known RFI 
frequencies. Once again, these lines are successfully removed 
in the subsequent verification stages as demonstrated by the 
final source velocity distribution. 
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3.8 Narrow-Band Follow-up 

As detailed in Paper II, an extensive program of Parkes 
narrow-band follow-up observations has been carried out to 
test the reliability of HiCAT. To additionally remove as many 
false detections as possible, low peak flux sources and those 
flagged as uncertain were preferentially targetted. In total, 
1082 HiCAT source were confirmed in this process and 119 
removed. Detections confirmed by narrow-band observations 
have been flagged as such in the final catalogue. 



4 DATA 

The full data of HiCAT is available on-line at: 

http://hipass.aus-vo.org 

This database is searchable in a number of ways, in- 
cluding by position and velocity. Returned parameters can 
be individually selected, along with any of the image prod- 
ucts, including detection spectra, on-sky moment maps and 
position- velocity moment maps. The format of the returned 
catalogue data can also be chosen, with both HTML and 
plain text available. An extract of the Hipass Catalogue is 
given in Table 6. 

We encourage other researchers to make use of this 
database. For optimum utility, researchers also need to be 
aware of the completeness, reliability and accuracy of the 
measured parameters. These are described in detail in Pa- 
per II. Users are also encouraged to be familiar with the full 
processing of HlPASS data (Barnes et al. 2001). 



5 BASIC PROPERTY DISTRIBUTIONS 

To show the basic properties of the HiPASS galaxies. Figure 9 
plots the bivariate distributions of velocity, velocity width, 
peak flux and integrated flux. Plotted along the diagonal are 
the one-dimensional histograms of each quantity. 

The large-scale structure of the local universe is clearly 
visible in the velocity distribution, with two prominent over- 
densities appearing at --^1,600 and --^2,800 kms~^. This dia- 
gram also shows the noise- limited nature of the survey, with 
no sharp cutoff in galaxy numbers observed at the highest 
observed velocities. From the peak flux distribution, it is 
clear that most galaxies are close to the detection limit of 
the survey, with the distribution peaking at ~50 mjy. 

The sharp decline at the upper end of the galaxy mass 
function can be seen by examining the peak flux- velocity and 
integrated flux-velocity distributions. This effective upper 
limit to Hi mass results in galaxies at high distances all 
having low peak and integrated fluxes. The steepness of the 
decline is apparent in the relatively sharp boundary between 
the fiux-velocity regions in which galaxies are, and are not, 
observed. The low space density of the most massive galaxies 
is also visible in the width-velocity distribution, with small- 
velocity-width galaxies preferentially found nearby due to 
the small volumes surveyed. 



6 LARGE SCALE STRUCTURE 

Figures 10 and 11 plot the equatorial coordinate polar and 
Aitoff projections of the full on-sky distribution of HiCAT 
sources respectively. Points are shaded according to the ob- 
served recessional velocities. From these plots, the inho- 
mogeneous distribution of HI sources in the local universe 
is clearly apparent. Most obvious are the Super-Galactic 
plane (marked with a dark solid line), the Fornax wall 
(comprising the over- densities around Dorado, Fornax and 
Eridanus; passes through RA=3.5 hrs, Dec=— 30°), and a 
third filament extending from near the Super-Galactic plane, 
through Antlia, Puppis and Lepus (passes through RA=8 
hrs, Dec=— 30°). Also notable are significantly under-dense 
regions, the most prominent of which is the Local Void 
(around RA=18 hrs, Dcc=— 30°). The location of large-scale 
structures is important when determining the representative 
nature of galaxy surveys given their search region. 

HiCAT also maps structures behind the Milky Way, with 
relatively little obscuration apparent in the observed distri- 
bution. Many of these structures and filaments have been 
previously identified (Kraan-Korteweg et al. 2000; Henning 
et al. 1998), however their uninterrupted path through the 
Zone of Avoidance is traced out here for the first time in a 
single survey. 

In Figures 12 and 13, the velocity distribution of HiCAT 
galaxies is plotted in two different projection sets. In Fig- 
ure 12, the southern sky hemisphere is divided like segments 
of an orange into six wedges, each with a common base axis 
along the line RA=0-12 in the equatorial plane. The loca- 
tion of these wedges with respect to the southern celestial 
hemisphere is shown in Figure 14. The shape of these regions 
results in volume increasing in the vertical rather than ra- 
dial direction. Thus, more galaxies are present in the central 
horizontal regions when compared to similar velocity regions 
further out. Figure 13 also divides the southern hemisphere 
into six wedges, this time using a common base axis along 
RA=6-18 in the equatorial plane. 

Those figures again show the Super-Galactic plane and 
the Local Void. Figure 12(d) contains the bulk of the Super- 
Galactic plane in face-on projection and Figures 13(c)-(e) 
show the plane in edge-on cross sections. From this sec- 
ond set of diagrams it is also clear that local regions of the 
plane are in fact comprised of two main parallel components. 
The lower right regions of these diagrams show the signif- 
icantly under-dense region of the Local Void. Figure 12(f) 
most clearly shows this in the orthogonal projection in the 
centre of the diagram. Also marked in the velocity diagrams 
are the locations of four major nearby clusters (F=Fornax, 
C=Centaurus, H=Hydra and N=Norma), their locations co- 
inciding with over-dense regions in the HiCAT data. 



7 SUMMARY 

HiPASS is the first blind Hi survey of the entire southern 
sky. Using the Multibeam receiver on the Parkes Telescope, 
this survey covers 5 < -1-25° with a velocity range —1,280 to 
12,700 kms^^. Mean noise for the survey is 13 mJy beam^^. 
Taking data from the southern region of this survey S < -1-2° , 
we have compiled a database of 4,315 sources selected purely 
on their Hi content. Candidates are found using automatic 



Table 6. Extract of the HiPASS Catalogue. The catalogue is presented in its entirety in the electronic version of this journal. Parameter descriptions are given in Table 4. 



Name 



RA 

(hrs) 

^speclo 

(kms^-'-) 
(kms^-'-) 



Dec 

(deg) 

^spcchi 

(kms^-'-) 
box size 
(arcmin) 



(kms -*-) 

^mask 

(kms^-"-) 
comment 



''so 
(kms^-*-) 

(kms^-"-) 
follow-up 



(kms ^) (kms ■'-) 

^w'" ^20'''' 

(kms^-"-) (kms^-^) 

confused extended 



(km s ■'- ) 

W2T"_ 
(km s ^) 



(kms^-'-) 

(Jy) 



(kms^-"-) 

•S'int 

(Jy kms^-^) 



(kms^-'-) (kms^-"-) (kms^-'-) (kms^-"-) 
RMS RMSciip RMScubo cube 

(Jy) (Jy) (Jy) 



HIPASSJOOOO-07 



HIPASSJOOOO-40 



HIPASSJ0002-03 



HIPASSJ0002-07 



HIPASSJ0002-15 



HIPASSJ0002-52 



HIPASSJ0002-80 



HIPASSJ0004-01 



HIPASSJ0005-07 



HIPASSJ0005-11 



00:00:25.8 

2210.7 

158 

00:00:32.3 

1617.3 

158 

00:02:00.5 

4503.7 

158 

00:02:03.7 

2307.5 

158 

00:02:29.5 

1878.9 

158 

00:02:29.7 

200.1 

158 

00:02:57.4 

493.7 

158 

00:04:46.5 

5608.2 

158 

00:05:12.3 

2204.2 

158 

00:05:15.6 

5239.8 

158 



-07:49:56 

5284.0 

28 

-40:29:54 

4733.8 

28 

-03:17:01 

7540.8 

28 

-07:37:56 

5240.8 

28 

-15:58:25 

4971.3 

28 

-52:47:15 

3030.3 

28 

-80:20:51 

3424.0 

28 

-01:35:50 

8795.5 

28 

-07:03:43 

5432.7 

28 

-11:29:37 

8263.9 

28 



3747.8 

3619,3807 

1 

3170.8 

2993,3342 

1 

6001.8 

5918,6096 

1 

3764.8 

3715,3803 

1 

3416.2 

3328,3532 

1 

1500.2 

1416,1578 

1 

1958.3 

1861,2060 

1 

7194.1 

7024,7386 

1 

3810.0 

3628,4046 

1 

6762.0 

6653,6911 

1 



3747.8 

58.1 

yes 

3170.8 

239.1 

yes 

6001.8 

126.4 

no 

3764.8 

44.7 

yes 

3416.2 

93.6 

yes 

1500.2 

111.4 

yes 

1958.3 

98.7 

no 

7194.1 

221.4 

no 

3932.9 

314.8 

yes 

6762.0 

132.7 

yes 



3721.8 

58.1 

no 

3169.6 

239.1 

no 

6005.3 

126.4 

no 

3764.0 

44.7 

no 

3420.8 

93.6 

no 

1498.0 

111.4 

no 

1961.3 

98.7 

no 

7186.9 

221.4 

no 

3816.1 

69.1 

no 

6781.3 

132.7 

no 



3721.8 

124.9 

no 

3169.6 

258.2 

no 

6005.3 

152.5 

no 

3764.0 

65.8 

no 

3420.8 

147.1 

no 

1498.0 

138.1 

no 

1961.3 

132.8 

no 

7186.9 

309.4 

no 

3816.1 

346.6 

no 

6756.0 

229.5 

no 



3728.2 
124.9 

3167.9 
258.2 

6002.0 

152.5 

3765.2 
65.8 

3422.6 
147.1 

1499.7 
138.1 

1959.2 
132.8 

7186.7 
309.4 

3817.7 
346.6 

6768.7 
179.0 



3759.8 
0.039 

3071.6 
0.066 

5966.7 
0.055 

3773.3 
0.048 

3435.6 
0.039 

1504.6 
0.051 

1958.3 
0.242 

7109.8 
0.046 

3949.3 
0.086 

6750.8 
0.039 



3811.7 
2.7 

3138.1 
12.0 

6099.1 
6.8 

3848.5 
2.2 

3478.1 
3.7 

1427.3 
5.6 

1807.4 
23.4 

7287.7 
9.4 

3901.3 
18.0 

6837.9 
5.2 



3870.1 
0.0065 


3377.6 
0.0056 


3619.0 
0.0115 


3807.3 
336 


3149.8 
0.0081 


2923.4 
0.0063 


2993.2 
0.0115 


3342.1 
146 


6162.8 
0.0064 


5645.6 
0.0056 


5917.5 
0.0115 


6095.7 
337 


3907.1 
0.0073 


3414.8 
0.0069 


3714.6 
0.0122 


3803.4 
286 


3526.0 
0.0082 


3089.5 
0.0069 


3327.8 
0.0115 


3532.2 
237 


1419.4 
0.0064 


1318.3 
0.0055 


1416.3 
0.0115 


1578.0 
107 


1758.6 
0.0090 


1943.4 
0.0070 


1860.9 
0.0122 


2060.4 
9 


7353.2 
0.0071 


6829.6 
0.0067 


7023.7 
0.0115 


7386.3 
337 


3960.5 
0.0072 


3467.6 
0.0060 


3627.7 
0.0122 


4045.7 
286 


6891.6 
0.0072 


6426.5 
0.0065 


6652.8 
0.0122 


6911.0 
286 
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Figure 9. Log-log bivariate distributions of observed parameters: velocity {v^f"^), velocity width {W^^'^^), peak flux (Sp) and integrated 
flux (Sint)- Single parameter distributions are also given along the diagonal. 



finder algorithms, before being subjected to an extensive 
process of manual checking and verification. Issues examined 
in compilation of the catalogue include RFI and recombina- 
tion lines, along with additional processes for confused and 
extended sources. A large program of follow-up observations 
is also used to confirm catalogue sources. Completeness, re- 
liability and parameter accuracy of HiCAT sources is ad- 
dressed in Paper II. Optical counterparts to HiPASS galaxies 
are examined in Paper III. 

The large-scale distribution of sources in the catalogue 
covers a number of major local structures, including the 
Super-Galactic plane along with filaments passing through 
the Fornax and Puppis regions. Also notable is the probing 



of structure in the Zone of Avoidance, a region significantly 
obscured in optical bands. The survey also surveys a number 
of under-dense local regions including the Local Void. 



This database offers a unique opportunity to study a 
number of important astrophysical issues. Future papers will 
address topics such the cosmic mass density of cool baryons, 
the properties of Hi selected galaxies at other wavelengths, 
the comparative clustering of Hl-rich galaxies, the TuUy- 
Fisher relation, the role of hydrogen in galaxy group evolu- 
tion and the study of Hl-rich galaxy environments. 
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Figure 10. Polar projection of HiCAT detections. Objects are colour-coded by velocity as noted in the key. Coordinate systems are 
marked as follows: light solid lines - equatorial coordinates; dotted lines - galactic coordinates; dark solid line - super-galactic plane. 
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Figure 12. Velocity distribution of HiCAT sources divided into segments all hinged on the equatorial plane at the 0-12 hour line starting 
with segment (a) with one side lying in that plane centred at RA=6hr, (b) above that and so on with (f) again with one side on the 
plane, but centred at RA=18hr. The location of these wedges in the southern sky is shown in Figure 14. 
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Figure 13. Velocity distribution of HiCAT sources divided into segments all hinged on the equatorial plane at the 6-18 hour line starting 
with segment (a) with one side lying in that plane centred at RA=Ohr, (b) above that and so on with (f) again with one side in the 
plane, but centred at RA=12hr. 
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Figure 14. Location in the southern sky of the velocity distribution wedges shown in Figure 12. 



